Answering Structured Queries on Unstructured Data
نویسندگان
چکیده
There is growing number of applications that require access to both structured and unstructured data. Such collections of data have been referred to as dataspaces, and Dataspace Support Platforms (DSSPs) were proposed to offer several services over dataspaces, including search and query, source discovery and categorization, indexing and some forms of recovery. One of the key services of a DSSP is to provide seamless querying on the structured and unstructured data. Querying each kind of data in isolation has been the main subject of study for the fields of databases and information retrieval. Recently the database community has studied the problem of answering keyword queries on structured data such as relational data or XML data. The only combination that has not been fully explored is answering structured queries on unstructured data. This paper explores an approach in which we carefully construct a keyword query from a given structured query, and submit the query to the underlying engine (e.g., a web-search engine) for querying unstructured data. We take the first step towards extracting keywords from structured queries even without domain knowledge and propose several directions we can explore to improve keyword extraction when domain knowledge exists. The experimental results show that our algorithm works fairly well for a large number of datasets from various domains.
منابع مشابه
Do it your own (DIY) Jeopardy Question Answering System
The evolution and maturity of semantic technologies techniques and frameworks are bringing functionalities which were once considered academic or prototypical into real-life applications. Products such as IBM Watson [1] and Siri are examples of applications which are heavily leveraged on state-of-the-art semantic technologies. These systems provide a synthesis of the functionalities which are a...
متن کاملAnswering Boolean Hybrid Questions with HAWK
The decentral architecture behind the Web has led to pieces of information being distributed across data sources with varying structure. Hence, answering complex questions often requires combining information from structured and unstructured data sources. We present an extension for HAWK, a novel search approach for Hybrid Question Answering based on combining Linked Data and textual data. Espe...
متن کاملHAWK - Hybrid Question Answering Using Linked Data
The decentral architecture behind the Web has led to pieces of information being distributed across data sources with varying structure. Hence, answering complex questions often required combining information from structured and unstructured data sources. We present HAWK, a novel entity search approach for Hybrid Question Answering based on combining Linked Data and textual data. The approach u...
متن کاملA Comparison of Search Engine Technologies for a Clinical Data Warehouse
A clinical data warehouse (DW) can be used to recruit patients for clinical studies or statistical analysis. For improved user experience, it is crucial that the search engine technology of the DW answers user queries quickly. In this paper, we investigate the performance of the two most popular technologies for regarding structured and unstructured data query answering: a database and a search...
متن کاملInformation Theoretic Retrieval with Structured Queries and Documents
Information retrieval through statistical language modeling has become popular thanks to its firm theoretical background and good retrieval performance. One goal of current research on structured information retrieval is thus to extend such models to take advantage of structure information. As a structure may be present on documents or queries or both, we are interested in supporting not only u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006